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Wavefunction collapse via a nonlocal 
relativistic variational principle 



Abstract We propose, as an alternative theory of quantum mechanics, a relativis- 
tically covariant variational principle (VP) capable of describing both wavefunc- 
tion collapse and, as an appropriate limiting case, evolution of the wavefunction 
according to the standard quantum mechanical (SQM) wave equation. This re- 
sults in a nonlinear, nonlocal, time-symmetric hidden-variable theory; the hidden 
variable is the phase of the wavefunction, which affects the dynamics via zitterbe- 
wegung. 

The VP is 8(A \ + £^2) = 0, in which A 1 and A 2 are positive definite integrals 
(over all spacetime) of functions of the wavefunction !//"(?, x). A\ is quadratic in 
deviations of the wavefunction from compliance with the SQM wave equation. A2 
is a measure of the uncertainty of the wavefunction, driving collapse by penalizing 
certain kinds of superpositions. We also show that A\ limits the rate of collapse, 
and that it enforces the Born rule, with suitable assumptions and approximations. 

Since the VP optimizes a function \ff of both space and time, the theory is 
not "causal" in the usual sense. Because it is not clear how Nature solves the 
optimization problem (e.g., whether a global or a local minimum is sought), we 
cannot yet say whether it is deterministic. 

Keywords Quantum foundations ■ Quantum nonlocality 
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1 Introduction 

Although the standard theory of quantum mechanics (SQM, for short) has been re- 
markably successful for many years, foundational and interpretational issues that 
troubled some of its founders (see e.g. H]) are by no means resolved. [2,3,4] For 
instance, the wave equation is a time-symmetric, deterministic, linear differential 
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equation. On the other hand, the collapse process is not expressible as an evolu- 
tion equation; the only prediction that this element of the theory can make is of 
the probabilities of the various possible outcomes in an ensemble of identically 
prepared experiments. The collapse mechanism is apparently unique, or almost 
so, among physical theories both because it is intrinsically random, and because 
it is asymmetric in time (separating a superposition of states in the past from a 
single state in the future). In addition, the regimes of validity of the wave equation 
and the collapse process are defined in terms of whether or not a "measurement" 
is being performed, but that term (and the related term "observer") are not defined 
with the degree of clarity we expect for such fundamental concepts. 0I6) 

The failure of SQM to give a detailed description of the collapse process is 
typically regarded as a conundrum for those who seek intuitive understanding, but 
not a practical weakness because it does not hinder calculations. However, one can 
design a set of thought experiments exhibiting the ambiguity of the theory. We can 
imagine an infinite set of experiments {E(X)} in which all the input parameters 
of the experiment are continuous functions of X, with < X < 1, such that E(0) 
is clearly not a measurement and E(l) is. For instance, if the measurement E(l) 
requires turning on some probing electromagnetic field F(t), then E(X) might be 
defined as the experiment conducted in the same way but using the field XF(t). 
SQM would say thatis(O) does not collapse the wavefunction and E{\) does — so 
at what value of X does the collapse first take place, and how can one justify the 
abrupt transition from no collapse to collapse? 

Because the results of the collapse process appear random — experiments with 
apparently identical initial conditions are observed to give different results — it is 
natural to suspect that a "hidden variable" is involved, so that such a set of mea- 
surements in fact have different initial conditions. Then, presumably, when the 
hidden variable is accounted for, experiments that are indeed identically prepared 
will yield identical results after all. This promising resolution encounters an obsta- 
cle in Bell's Theorem [7| and relations equivalent to it [8|, because that theorem 
implies that phenomena violating those inequalities cannot be explained by local 
hidden- variable theories. 

Because a great many alternative interpretations and theories of quantum me- 
chanics have appeared in recent years, we will mention just a few of the most 
successful. 

Cramer [9 1 has developed a "transactional interpretation" of QM that involves 
two-way "transactions" between lightlike separated points in spacetime. The spa- 
tial and temporal nonlocality of this approach enable it to explain nonlocal phe- 
nomena more plausibly than SQM can, in this author's opinion. However, since 
this is an interpretation and not a modification of the theory, the wave equation is 
unchanged, and the theory is still unable to quantitatively describe the transitions 
between measurement and non-measurement regimes (such as the dependence on 
X in the thought experiments described above). Cramer's interpretation may turn 
out to be consistent with the theory we will describe below, but we assert that it 
does not go far enough by itself. 

Griffiths' "consistent histories" interpretation H10llll|[T2l identifies time-ordered 
sets of mutually consistent events as "histories." It enables the inference of proba- 
bilistic statements about unmeasured quantities, under appropriate circumstances. 
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This is a very different approach than we have followed. Like Cramer's work, it is 
a re-interpretation but not a correction of the theory. 

The collapse theories of Pearle, Ghirardi, Rimini and Weber IU3II14|[T5 1 pro- 
pose that an external source of noise ("hittings") acts on the wavefunction in a 
way that produces the observed variety of experimental outcomes from a single 
initial condition. This results in some desirable properties, such as a much more 
rapid collapse for macroscopic objects than for very small systems. Our princi- 
pal objection to this type of theory is that the external noise source lacks physical 
justification. 

The decoherence explanation lll6lll7l[T8l[T9l also relies on an external source 
of noise, in this case, the environment surrounding the system and the measure- 
ment apparatus. According to this view, unavoidable entanglement of these three 
regimes leads to the result that, when the system and apparatus are measured and 
the environment ignored, the system appears to have collapsed into a single state. 
Although some questions have remained about this explanation, (20) it is among 
the most successful attempts to explain the collapse process. 

As an alternative to relying on an external source of noise, and in search of a 
conceptually more compact theory, we will here consider a model which depends 
only on properties of the system under study, possibly including the measurement 
apparatus with which it is entangled. This of course constitutes a hidden-variable 
theory; by the considerations mentioned above, it must be a nonlocal theory so as 
not to violate the constraints of Bell's Theorem and the associated experimental 
findings. Although this program will lead to some unusual assumptions about the 
operation of quantum mechanical systems in space and time, we will find that it 
holds great promise to explain issues and observations that are troubling at best 
for other interpretations of quantum mechanics. 

2 Requirements to be satisfied by the theory 

2.1 Constraints 

We intend the new theory to do the following things: 

1 . Describe matter waves themselves, and not principally the knowledge a human 
being actually or potentially has about the system under study. 

2. Describe wavefunction collapse (state reduction) in a natural wayQ 

3. Apply in the same way — that is, by solving the same equations — whether or 
not a measurement is being made (and thus not depend on the presence of an 
observer or the precise definition of "measurement" [|5]). 

4. Agree with the SQM wave equation under conditions in which the latter should 
hold (that is, when a "measurement" is not being made). 

5. Predict experimental outcomes distributed according to the Born rule (for mea- 
surements typical of the body of experiments that have been done; we will 
explain that caveat presently). 

1 We do not require collapse to be instantaneous. Under SQM, a decay time At less than or 
of the order of 1 j AE is considered indistinguishable from zero, so the experimental evidence — 
which has been interpreted according to SQM — cannot rule out nonzero collapse durations sat- 
isfying that inequality. We therefore regard the term decay as more accurate than collapse, but 
will use the latter term in conformity with accepted nomenclature. 
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6. Depend on a hidden variable or variables (allowable due to characteristic 11, 
below), rather than an external "noise" source unrelated to the system under 
study, to break the symmetry among possible experimental outcomes. 

7. Be deterministic (like other fundamental laws). 

8. Be time-symmetric (like other fundamental laws). 

9. Apply in the relativistic domain (and thus be expressible in covariant form). 
This requirement might seem excessive, because the issues we essay to ad- 
dress here are manifested in nonrelativistic SQM. However, we will propose 
below that the hidden variable central to the choice of outcomes of the collapse 
process is the phase of zitterbewegung oscillations, for which the simplest de- 
scription is relativistic. Therefore we find it necessary to construct a relativistic 
theory to explain experimental observations that, in every other way, are com- 
pletely confined to the nonrelativistic domain. We are of course hopeful that 
the relativistic theory outlined below will apply as well to explain fully rela- 
tivistic phenomena (although in this initial work we have stopped short of a 
field-theoretic analysis, which may well be needed at some point). But for the 
time being, to keep the scope of this paper somewhat bounded, we will focus 
our attention on measurements and gedanken experiments at low energies and 
very subluminal speeds. 

10. Be nonlinear. This follows from conditions 2 and 3, which mean that the col- 
lapse must follow from the fundamental formulation (e.g., wave equation) of 
the theory, without the need for auxiliary constructions or variables (like de 
Broglie-Bohm pilot waves H21U221 ). 

11. Be nonlocal in space both to be able to distinguish pure eigenstates from su- 
perpositions, and to avoid the Bell's Theorem [7 1 prohibition on local hidden- 
variable theories (cf. condition 6). [23 1 

12. Be nonlocal in time. This is implied by conditions 1 1 and 9, because observers 
in different reference frames must agree on whether the wave function at a 
spacetime point A depends on conditions at B, but they may not agree on 
whether A and B have the same time coordinate. 

13. Cross over smoothly and naturally from microscopic to macroscopic behav- 
ior, dispensing with the need for two sets of rules in regimes separated by a 
boundary (the "shifty split" 0). 



Obviously, we must explain how a new theory that differs from SQM can be 
consistent with the experimental record, which is generally understood as being 
consistent with SQM. Therefore, in pursuit of that explanation, discussions below 
of experimental and measurement processes are meant to refer to processes and 
technologies that are likely to have been employed up to the present time, unless 
otherwise indicated. For instance, the experimental record is generally understood 
to be consistent with the Born rule, and the theory to be presented below will 
explain that in terms of standard experimental techniques of the past and present, 
but will not rule out the possibility that future experiments may contradict the 
Born rule by detecting or manipulating the hidden variable, or by using technology 
not generally available up to now. [This is the reason for the caveat appended to 
constraint 5 above.] 
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2.2 Phase as the hidden variable 

We propose that the hidden variable is the phase of the wavefunction at some 
time relative to the measurement (this choice was explored by Pearle [24|) — or 
equivalently, the start time of the experiment relative to the oscillations of the 
wavefunction. Suppose we write an energy eigenstate j as 

(We will takec = /z = 1.) 

If iff is a superposition of states with different energies 



then there exist operators 6 such that the combination ff\jf contains cross terms 
("beats") that oscillate in time (23. For instance, if 6 commutes with functions 
of time, 



jjk 



(2.1) 



Experimental detection and measurement of the wavefunction require it to be 
localized, at some stage of the experiment, to within a spatial extent comparable 
to the size of the laboratory. This means that the localized wavepacket includes 
contributions from negative-energy modes [26 27 28 1, so the sum (12.1b includes 
terms (zitterbewegung) for which E; and Ej< have opposite signs. 

Now suppose that an experiment is begun at some time and that an ex- 
perimental result is read at some time T after the beginning of the experiment. 
Substituting 

t = A + T (2.2) 

into J2.lt gives an expression that varies sinusoidally with with a period — 
2n/(Ej — Ejc) in each term of the sum. The experimenter typically controls T but 
would not attempt to control f;, both because he/she is ignorant of the initial phase 
of the wavefunction and because, for the zitterbewegung terms, will not sig- 
nificantly exceed 

7t/m^4x 10~ 21 seconds for electrons , (2.3) 

so control of f; to within a fraction of is not currently attainable. Therefore a set 
of "identically prepared" experiments would be expected to have different values 
of t\ and hence different outcomes. For this reason we can regard ?; as the hidden 
variable. To very good approximation, we regard it as a random variable chosen 
from a uniform distribution on [0, T] for some T ^> Tj^Mj^k. 

Since the zitterbewegung terms in l//^ (?y are likely to vary from one exper- 
imental realization to another, for the reasons just given, we propose that they 
determine the outcome of the measurement. Since nonrelativistic systems do not 

2 Actually, the "reading" of the result is likely to be a process that takes place over a range of 
times, not a single instant, but that additional complication does not affect the conclusion that 
we will reach. 
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contain appreciable contributions from negative-energy states, the zitterbewegung 
terms are small in such systems. It is natural to ask whether they are large enough 
to drive wavefunction decay, or at least to determine the outcome of such decay. 

Consider a nonrelativistic wavepacket localized in space, which therefore con- 
tains some negative-energy components. If it is confined to a size Ax, then its 
representation as a superposition of plane waves must include contributions from 
modes with momenta up to Ap ~ 1 /Ax, and therefore the negative-energy states 
have an amplitude C; of order Ap/m~ 1 jmAx relative to the dominant positive- 
energy states. 11261 

Now we would expect one lower bound on the time required for a measure- 
ment to be At ~ Ax. If the zitterbewegung terms are to drive the wavefunction 
decay, their strength must therefore scale inversely with the size of the system, in 
order for the decay to be essentially complete on an experimental timescale. But 
that is what we have just shown. 

Thus we expect that even in nonrelativistic systems, zitterbewegung terms 
drive the decay to a single state when a measurement is made. Therefore the theory 
we seek must be a relativistic one. 



3 Formulation of the variational principle 

Because some of the constraints we hope to satisfy (decay to a single state, Born 
rule) are stated in terms of the eigenstates of the measured variable, as defined in 
SQM, and because we expect SQM to emerge as a special case of our theory, we 
will develop it in terms of the equations of SQM and their solutions. 

We will focus on nonrelativistic systems to develop the theory, even though we 
have found that a relativistic theory will be needed for that task. (We would like 
this theory to apply to relativistic systems as well, but there are sufficiently many 
interesting problems at low energies that we can focus our attention on them for 
the time being.) Thus we will consider only cases for which the energy changes 
and differences due to the experimental process (the imposed fields), and the rates 
of change of those energies and those fields, are small compared to the rest energy 
mc 2 of the particle. In this paper we limit consideration to fermions as the system 
to be described, so the relevant SQM wave equation is the Dirac equation II26II27I 

EH 

@Y = 0, (3.1) 

where S = ft/m— 1. 

If we use the representation 

^(J-U ^=(-<o) 

of the y matrices, then positive-energy wavefunctions of momentum p <^m look 
like 

/O(l) 
0(1) 



o 

Vo(£ 



(3.3) 
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and negative-energy wavefunctions like 



Iffc 




(3.4) 



3.1 Terms in the variational principle 

As explained above, the desired theory must be nonlocal in space and time; there- 
fore a natural mathematical form would includes (an) integral(s) over spacetime. 
Thus the wave equation must be an integral or integrodifferential equation. This 
suggests in turn that we formulate the theory as a variational principle (VP). 

We propose as such a principle that nature seeks to minimize the sum of two 
positive definite terms: one that vanishes when the wavefunction is a solution of 
the SQM wave equation, and so tends to drive the wavefunction toward such so- 
lutions; and one that increases with the uncertainty in the wavefunction, thus fa- 
voring minimum uncertainty states 0291 . 

A superposition of eigenstates of the operator corresponding to the measured 
property will generally have more than the minimum uncertainty. Therefore the 
tendency toward minimum uncertainty states will tend to cause such superpo- 
sitions to decay, as we expect when measurements are made. This idea will be 
illustrated below, in the example calculation of the two-slit experiment. 

The desired VP takes the form 

<5(Ai+eA 2 )=0 (3.5) 

in terms of positive definite functionals A i and A 2 of the wavefunction and a posi- 
tive dimensionless constant £. A\ measures the deviation of the wavefunction from 
a solution of the SQM wave equation, so minimizing it drives the wavefunction to 
obey that equation. We will see that A 1 plays two other roles: it forces the collapse 
to take place over a period of time rather than instantaneously, and it enforces the 
Born rule. 

The second term, A2, is a measure of the position-momentum (and energy- 
time) uncertainty of the wavepacket. We propose that under conditions typical of 
a measurement, minimizing this term drives reduction of the wavefunction to a 
single eigenstate of the operator corresponding to the quantity being measured. 
Since A2 must somehow select a state pertaining to that particular operator, we 
must include the measurement apparatus (or some part of it that is entangled with 
the system being measured) in the wavefunction that appears in the VP; then the 
tendency of A2 to minimize wavefunction uncertainty will make it unlikely for a 
measurement involving a macroscopic apparatus to end up in a superposition of 
macroscopic ("pointer") states. 

For an example in which part of the measurement apparatus is included in a VP 
calculation, see this author's calculation [30] of the electron two-slit experiment. 

The positive dimensionless number £ in the variational principle (13.51 allows 
us to adjust the relative sizes of its terms. We shall take it to be a constant, although 
in a more elaborate theory it could depend on iff in some way. Its magnitude is 
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unknown at present, but we may be able to measure or infer it in the future, as we 
gain more experience with the VP. 

A\ and A 2 are in general integrals over all of spacetime, but the variational 
principle can still be useful without solving for the entire history of the universe. 
This is because an experiment can usually be considered to be localized within 
some region of spacetime, meaning that states and events within do not in- 
teract with the exterior (complement) of ^.0 Since we expect the system to evolve 
according to the SQM wave equation except when a measurement is being made, 
the spacetime regions surrounding those measurements are islands surrounded by 
regions in which the SQM wave equation is satisfied and the integrands in A\ and 
A2 take their minimum values. We conjecture that the minimization problem for 
all spacetime then reduces to the problem of minimizing A\ + eA2 over each such 
"island," and that we can do so for each island independently of all the others. As 
a result, the global variational principle (13 .5 1 can be reinterpreted to apply to a sin- 
gle island. We conjecture that in a properly designed experiment, the experimental 
domain includes all of one (or more) island(s), that is, that its boundaries in- 
clude only spacetime points where the SQM wave equation is satisfied (to a degree 
of accuracy commensurate with the requirements of the experiment). 

Therefore we will write A\ and A2 as integrals over all of spacetime, with the 
understanding that it is generally permissible to limit the domains of integration 
to some bounded region 



3.2 A 1 term — preference for solutions of the SQM wave equation 

We will take as the first term in the VP 

A 1 = ((#^» 1 . (3.6) 

Here our notation 1 signifies the expectation value of an operator G that 

depends on a single spacetime coordinate x^ per particle. If we are dealing with 
single -particle states iff, this is defined in the usual way: 

_ Jd 4 x^(x)ff(x) ¥ (x) 
{{a)h - Jd*x ¥ T(x) ¥ (x) ■ iiJ) 

The double triangular brackets are to distinguish this notation from the three- 
dimensional matrix element 



77), = Jd 3 x v^ t (f,x)^(f,x)T](f,x) : 



which will be useful to us later on. (A useful mnemonic is that a single pair of 
brackets stands for an average over space alone, while a two pairs signify aver- 
aging over both space and time.) Then the one-point expectation could be written 

as 

fdt( V \ff\v) t 
U )H fdt( ¥ \l\Y)t ' 

3 This is of course an idealization, since the experimenter must interact with the experiment 
to set it up and to read out the measurement. The implications of this for the application of the 
VP and the definition of 3?, are a matter for further study. 
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We see that as expected, this form of A\ penalizes deviations from the Dirac equa- 
tion; solutions of that equation trivially give A \ its minimum value of zero. In fact, 
there is no other way to get A\ = 0, so the SQM wave equation @\jf = is both a 
sufficient and a necessary condition for Ai to vanish. 

We believe that it should be possible to construct analogous VPs for other 
SQM equations, such as the Klein-Gordon equation, by making appropriate sub- 
stitutions for the operator $! in (13.61 ), but as mentioned before, this paper is limited 
to fermions. 



3.3 Small-perturbation case 

It will be useful to write the wavefunction as a superposition of modes, and be- 
cause we will need to understand how the superposition evolves in time, we will 
now consider "small-perturbation" limitations under which such modes can be 
defined and followed in time. These limitations will help us understand the VP, 
but we do not mean to imply that its validity is limited to this case (or to a fixed 
reference frame, in which we will do the following analysis). 

Later in the paper, we will consider fidelity of our proposed quantum me- 
chanical principle to the Born rule. Since experimental tests of the Born rule 
must relate the measurement outcomes to the initial structure of the system un- 
der study, the measurement must be made in a way that perturbs the system only 
slightly. A large perturbation would ruin that relationship — but (generally speak- 
ing) a perturbation-free measurement is impossible. 

We will find it useful to write the Dirac equation in the form 



where the Dirac Hamiltonian H is defined so as to make (13.8b equivalent to (13.1b : 



Now the state energies will be important to our analysis of zitterbewegung, so we 
would like to express them as the eigenvalues of the Hamiltonian. Thus we will 
supplement the Dirac equation (13.8b with a Schrodinger-like eigenvalue equation 



and look for eigenstates that solve both equations. 

Since any experiment is carried out by manipulating the Hamiltonian, the latter 
must be a function of T, via the electric and magnetic potentials <P and A, but 
not otherwise on t, because it contains only spatial derivatives. (Recall that the 
experimenter is ignorant of Therefore T can be held fixed and the equation 
solved as a function of the spatial coordinates. As a result, for any value of T, the 
Hilbert space is spanned by a basis of eigenfunctions %j(T,X): 




(3.8) 



H = y {y-n + m)+eA". 



H\jf = E\jf 



(3.9) 



H(T,x)Xj(r,x)=Ej(T)Xj(T,x) 
satisfying the orthonormality relation in 3-space 




(3.10) 
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Let us suppose that H varies continuously with T. Therefore we expect that for 
any j, Ej(z) and ^ ; (t,x) are also continuous functions of time T, except possibly 
for a set of measure zero of values of t; let us assume that the experiment is 
designed so that those special cases are not encountered. Put another way, we 
number the eigenstates + 8z,x) in a way consistent with our numbering of 
them at t, so that 

lim x,-(t + 5t,x) = Xj{r,x). 

So a given state (choice of j) retains its identity as time evolves. 

Now we are in a position to be more precise in our statements that the pertur- 
bations in the Hamiltonian are small. Since each state is normalized to unity [Eq. 
(I3.10H . we require the rates of change to be small, that is, 



f d 
J d 3 XX'l(T,x)— £;(T,X)<. 



(3.11) 



for any choices of j, k and T. 

We have constructed the basis set {%j } as solutions of the eigenvalue equation 
(13.9b , but not the Dirac equation ( 13.8b . To get a basis that satisfies that equation as 
well, we note that any solution %; of the eigenvalue equation will still be a solution 
if it is multiplied by a function of t. Therefore 



Wj{t,x;ti) =Xj(t-ti,x)exp 



i J Ej(t'-ti)dt' 



(3.12) 



is a solution of the eigenvalue equation. (Note that we lose no generality by choos- 
ing t' = as the lower limit of integration, since the choice of the origin of time 
has not been, and will not be, otherwise constrained; note also that that equation 
is acceptable for all real values of ?, not just t > 0.) Our notation signifies that iff 
is the function of t and x that results from starting the experiment at time The 
dependence on /; will become important later on, but for the time being we will 
omit it from our notation for iff. Then 



d d Y ■ 

/— [xffj {t,x)] = Ej (t - ti)yfj (t,x) + i-^± 



exp 

T=r— fj 



-i f Ej(t' -ti)dt' 
Jo 



Now by the assumptions laid out above, the second term on the right-hand side is 
negligible compared with the first term; we can see this by operating on those two 
terms from the left with the operator 



I d 3 xxl(T,x) 



for all possible basis states Xfo using ( 13.111) , and remembering that for nonrela- 
tivistic potentials, the Dirac equation gives \E\ w m. Then since iff satisfies the 
eigenvalue equation, we can write 



ij- t [y'j(t,x)]=H{t-t i ,x)\irj(t,x), 



so the states {y/} } satisfy the Dirac equation as well as the eigenvalue equation. 
This property will allow us to easily switch between the eigenvalue equation, 
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which we will use in our analysis of the Born rule, and the relativistic expres- 
sions and equations that appear in the covariant form of the theory. 

Note also that equation ( 13.121) almost completely separates the space and time 
dependence of Xjfj, since %j depends on position but only slightly on time (via t). 
Because of our earlier stipulations on small and slow perturbations of the Hamil- 
tonian, %j ls to leading order a function of position alone. However, the fact that 
the relative change of Ej is also very small does not mean that it can be ignored in 
the exponential, as we have no reason to presume that 



/' [Ejit'-tJ-EjiO)] dt'<^\. 
Jtn 



Now the set { V}'(0} we a complete basis satisfying 

®\l/j = (3.13) 

and 

J d 3 x\j/ f j(t,x)\i/ k (t,x) = 8 jk ; (3.14) 
we can therefore expand a general wavefunction iff as 

xi/(t,x)=^Cj(t)xi/j(t,x). (3.15) 

j 

3.4 A i term — penalty for rapid evolution of the wavefunction 

The second property of A\ is easily understood if we expand the wavefunction 
in terms of eigenfunctions of the SQM (Dirac) operator as in (13.15b . Then from 

(Em 

®V=— ECjvO, (3-16) 
and it follows immediately that 

MI/|c'(f)| 2 

m z J at 

Thus A i penalizes rapid changes in the coefficients {C,}; for instance, instanta- 
neous collapse (\C'j(t)\<x 8(t)) would make an infinite contribution to the integral 
in the numerator. This establishes the second property of A\. 
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3.5 A i term — enforcement of the Born rule 

In this subsection we must be clear about those quantities that depend on so 
we will revert to the notation \jfj(t,x;ti) introduced in (12.2b and (13.12b . Then the 
normal-mode expansion of iff in (13.15b must be understood as 

Y(t,x;t i ) = "£Cj(t;t i )\ifj(t,x;t i ), 

j 

reflecting the fact that the evolution of the system, as expressed by the coefficients 
Cj in the expansion, also depends on [We will henceforth write the derivative Cj 
in identity ( 13.16b as a partial derivative.] Then the proportion ("branching ratio") 
of decays of a wavefunction given by ( 13.151) to the single state Xffj must be the 
initial weight 

Yj = \Cj{tut,)\ 2 (3.18) 

of that state. 

We wish to study the statistics of the experimental outcomes as the starting 
time is varied. If all modes present had the same frequency, we could average 
the starting time ?; over one period of oscillation. Since nontrivial systems will 
have multiple frequencies, that average is complicated, and we choose instead to 
average fj over an interval long compared to the periods of all the modes present. 
To denote the average on f;, we will use an overbar and drop the argument f;, thus 
for an arbitrary function F, 



1 f 1 

F(t,a) = lim — / dtiF(t,a;ti) 

7-->oo J Jt-T 



where the optional argument a stands for any set of independent variables besides 
f and fj. 

Now we expect that the term A2 in the VP will cause the system to decay to a 
single state, that is, 

for some k (which, we presume, depends on the choice of ?;). Then, in a large 
number of "identically prepared" experiments, the fraction of outcomes in state j 
is just the average over values of f; of the left-hand side of ( 13.19b . Then the Born 
rule can be expressed precisel)Q as 



Yj= lim|C y (f)| 2 . (3.20) 



This is the "branching ratio" property that our theory must reproduce. 
Because {^/(O) are a complete set, we can write 

(Y\ &9\y) t = £ (Y\ # I Wj) t (Vj\ ®W)t 
j 

= LN 2 



J 



4 In the general case, wher e the decay is to a set of degenerate states, we replace |C ; | 2 every- 
where it appears in equation ( 13,20) by the sum of that quantity over the degenerate set. 
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where 

Tj(r>ti) = (Yj\@\Y)f 

Thus the minimization of A\ tends to make \Tj\ as small as possible for every 
energy eigenstate Xjfj, at every time t. 

If A\ were the only term in the variational principle, it would attain a value 
of zero by making every T; vanish. But it is in competition with A2, which is 
trying to cause a superposition of modes to decay to a single mode, which requires 
some coefficients C, to have nonzero time derivatives, which [as we see from 
(13.17b ! prevents A 1 from going all the way to zero. So we will have to analyze that 
competition to determine how small the TVs will be. That is beyond the scope of 
this paper. Instead, let us study the effect of allowing one T; to vanish completely; 
this should approximate the effect of its being small but nonzero. 

We therefore set 

7) = (^|iF|v> ( =0 (3.21) 
for any j. Then we can use identity ( 13.16b to write 

dt 



k 



= I (\Cj I 2 ) < Yj I 7° I Yj) t + I q ^ < YJ I 7° I Yk), (3-22) 

Further progress depends upon understanding the matrix elements \ yfj\ Y°\Yk) r 
If the form (13.2b is used in the nonrelativistic case, it is easily seen from the wave- 
functions ( 13.3b and ( 13.4b and the orthonormality relation (13.14b that if Ej and 
have the same sign, 



{Yj\y°\Yk) t = 8 jk sga Ej + o(£) 



If they have different signs, so that \Ej - E k \ « 2m, then y° \ Yk)t i s ^(m) 
and, by Eq. ( 13.12b . has the value 



f AE jk {t' -h)dt' 
Jo 



(VOi 7 \Yk) t = (Xj\y \Xk) t -n ex P 

in which we have introduced the shorthand 

AE jk {T)=Ej(T)-E k (T). 

This is a rapidly oscillating function of t (with an angular frequency of the order 
of 2m), to wit, zitterbewegung. 

So, keeping terms to 0{p/m) in (13.22b . we see that 



•>l 2 ) 



sgn£; £ q(t;t i )^^{xj\y \Xk) t ^^p\i^AEj k (t'-t i )dt' 



EjE k <0 



(3.23) 
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Thus \Cj\ 2 is governed by a sum of zitterbewegung terms, causing it to vary on a 
very short timescale and in a way that depends on We will ensure by the action 
of A2 that the wavefunction will decay to a single state [Eq. ( 13.191 )1, but it appears 
that the zitterbewegung terms in (13.23t will determine in which state it ends up, 
and the different possible outcomes will arise from different choices of t\. (We 
will soon demonstrate that this is true.) This is the "hidden variable" behavior we 
predicted. 

Note also that the terms determining the evolution of |C/| 2 are of the order of 
the momentum p of the individual eigenstates. This linear dependence on momen- 
tum is consistent with our earlier argument about how the zitterbewegung terms 
must scale in order to determine the outcome of the wavefunction decay. 

Now we integrate the last equation and use the initial weights ( 13.18b : 



\Cj(t- tl )\ =Yj- 



d_ 

dt' 



Cj(t'; tl )\ 2 ) 



dt' 



(3.24) 



With a little effort we find that 



\Cj(f,ti)\ « Yj + isgnEj 

< y { cut-jyj^i( Xj \f\ Xk ) t _ ti 

K 1 AE jk (t- ti ) CXP 



EjE k <0 



C*^jQ d -^\ t = ti (xj\Y \X k )o 
AE jk {0) 



exp 



ij AE jk (t' ~h)dt' 



i AE jk (t' -ti)dt' 
Jo 



(3.25) 



(We give mathematical details in the appendix.) 

Next we need to take the t — > °° limit. But we know that due to ( 13.191 ), the 
partial derivative dC k /dt goes to zero in that limit, and the other factors in the first 
term in curly brackets are bounded, so that term drops out: 



lim|C/ » Yj 



1 s 



„ v q(.,;.,)^a)|,., l fe|y°|a) l , 

~ spi exp 

EjE k <0 



AE jk (t' -tijdt' 



Finally, we observe that none of the factors in the numerator and denominator of 
the fraction (with the possible exception of the partial derivative) actually depend 
on the start time for instance, Cj(ti,t\) is one of the quantities that would be set 
by the experimental design. Then when we average on the complex exponential 
averages to zero (to good approximation). The result is the Born rule (13.20b . which 
is what we set out to prove. 

Since the statistical distribution of outcomes (and thus all possible outcomes) 
resulted from an average over experimental start times t\, it follows that the dif- 
ferent outcomes were determined by the choice of f;. This establishes the role of 
the start time of the experiment relative to the oscillations of the wavefunction 
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(or equivalently, the wavefunction's phase at a given time relative to the experi- 
ment) as the necessary hidden variable that determines the outcome. Our deriva- 
tion shows that zitterbewegung is the mechanism by which the choice of ?; deter- 
mines the outcome. 

Note that this derivation is approximate. The supposition in ( 13.21b that Tj is 
exactly zero is not accurate; as we pointed out, its size will actually depend on how 
well the A2 term fares in its competition with A]. Also, the analysis after equation 
13 .2 II includes some approximations. Therefore the appropriate conclusion is that 
the Bom rule is an approximate law. This suggests that natural phenomena may 
disagree with the Bom rule under some circumstances, and by some (presumably 
small) amounts, that could be predicted by our theory. This may constitute an 
opportunity for a feasible experimental test of the VP. (See the last paragraph in 
subsection 12. II ) 



3.6 A 2 term — wavefunction collapse 

In order to construct A2 we begin by constructing an expression for (squared) 
position uncertainty Sx 2 . It would seem natural to define it by 

8x 2 = «| Xl -x 2 | 2 » 2 , (3.26) 

where the expectation of a two-coordinate operator ((^(xi,X2)))2 is defined by 
analogy to We need to modify this definition in two ways. First, we re- 

place the operator within the angle brackets by the relativistically covariant expres- 
sion — (x^ — *2 )(xifi —X2[i). (We will use the summation convention for repeated 
Greek indices, which run from to 3.) 

The second modification is motivated by our intention that the (squared) wave- 
function uncertainty 8x 2 5p 2 should be meaningful at, or at least near, a given 
instant in time, so our expression for it must couple positions and momenta that 
are "at the same time" in some sense. Since the property of simultaneity depends 
on choice of reference frame, we will instead require that the spacetime locations 
sampled in computing 8x 2 8p 2 be spacelike separated. Therefore we will include 
within integrands some function f{x\ —X2) that vanishes whenever x\ and X2 are 
timelike separated. The simplest choice of / is of course 

/(z) = w(-z%) (3.27) 

where u is the unit step (Heaviside) function, but many other forms are possible. 

With these changes, we define the expectation of a two-coordinate one-particle 
operator as 

, > v = Jd 4 xid 4 x 2 \jf' ] '(xi)\if^(x 2 ) 2 (xux 2 ) \jf{x l )\jf(x 2 )f(xi -x 2 ) 
2)h ~ Jd^d A x 2 ^{x l )x i f\x 2 )xif{x i )xif{x 1 )f{x l -x 1 ) ] 



5 Note that the expression in d3.26t is twice another common expression for the squared posi- 
tion uncertainty: ((|x, -x 2 | 2 }} 2 = \ 2 )) l - 2((x, -x 2 » 2 + «|x 2 | 2 )) , = {{N 2 })! -2((x 1 » r 
((X2» 1 + {{|x 2 | 2 )) 1 =2[«|x,i 2 }} 1 -(«x 1 }} l ) 2 ] =2«|x,-((x 1 }} 1 | 2 )) 1 . However, we are 
about to redefine {{ff(xi,X2))) 2 in a wa y that will prevent ((xi -x 2 }} 2 from being factored in 
this way except in particular special cases, so our uncertainty expression will in general not be 
simply related to {{|xi — ( ( x i ) ) 1 1 2 ) ) 1 - 
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In this definition, the subscript 2 attached to & is a reminder that it depends on 
two arguments; the subscript 2 on the triangular brackets is to distinguish the ex- 
pectation formula defined here, which applies to two-argument operators, from 
expectations defined in ( 13.7b and in other definitions below. 

We are familiar with the utility in SQM of the three-dimensional inner product 
integral 

Jd 3 x\lfl(t,x)\lf 2 (t,x), (3.29) 

which is central to, for instance, orthonormality relations needed for the construc- 
tion and use of basis sets. We would like the relativistic theory we are develop- 
ing to have similar properties in appropriate limiting cases. In particular, we note 
that if the products y/ 1 " (xi ) (x 2 ) W( x i ) W{ x i) ar, d W f i x \ ) ( x 2) @{ x \ , x i) W( x i ) 
V(x2) are constant in time, then the temporal integrations on x® in ( 13.28b can be 
performed first. If we choose the unit step function (13.271 for /, the inner integral 
is 



J dx\f(x x -x 2 ) =2|xi-x 2 | 



expressing the fact that at the location x 2 , the time interval Ax® over which (x® , x 2 ) 
is spacelike separated from (x®,xi) is proportional to the spatial separation |xi — 
x 2 |. As we prefer not to give greater weight to greater separation distances in 
expectation calculations such as ( 13.281) , we might take instead of ( 13.271) the form 

m^ 1 ^- (3-30) 

Then 

r dxif{x x -x 2 ) = l, (3.31) 



so the numerator and denominator in (13.281 factor into products of orthogonality 
integrals like ( 13.291 1 — a useful property, as we will see in due time. Unfortunately, 
this form of / is not relativistically covariant, but if we choose instead 



we find that ( 13.31b is still satisfied. 

We might in this way define position uncertainty as 

8x 2 ee -*£)(*!„ -x 2M )» 2 (3.32) 

and momentum uncertainty as 

Sp 2 =((-[p^(x l )-p^(x 2 )] M*i)-^M)> 2 , (3.33) 

and define A 2 as the product of 8x 2 and 8p 2 , but that would give cross terms 
composed of non-conjugate variable pairs, such as 8y 2 8p 2 . Instead, let us use 
the combination (Sx^ 8p^) 2 within the angle brackets. It will also give unwanted 
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cross terms (e.g., 8x^ 1 8p^ = \8x v 2 5p v= 2), but they will not be quadratic and 
should therefore not contribute to expectation values. This leads us to define 

5x 2 8p 2 = {« -4) [pi^x,) - p^(x 4 )}} 2 (3.34) 

[where the LHS is simply the notation for a new operator, and not the product of 
(f332T> and d3~33l) l and 

A 2 = ((8x 2 8p 2 )) 4 , (3.35) 

in which ((^4))4 is an extension of the expectation defined in ( 13.28b to operators 
depending on four spacetime points: 

«^4)>4 = 

d 4 x\ d 4 X2d 4 X3 d 4 X4 ^{x\) iff f (x2) yH x 3) w\ x ^) &a{x\ j • ■ -m) 



y{x{)y{xi)y{x?)y(x4)f{{xk-xi :i<k<l<4}) 
j d 4 x\ d 4 x 2 d 4 x 3 d 4 X4 y\x\) ^(^2) V^( x 3) ¥ f ( x 4) 

y(xx)y{x2)y{x-$)\\r{x A )f[{x k -xi :l<k<l<4}) 



(3.36) 



and the function /({x^ — Xi : 1 < k < I < 4} ) enforces the spacelike separation of all 
four points: 

f( v y Y Y r v ^ - ntlllLt+l « -^*)(^ ~^)] n ^ 

/(Xl -X2,Xi -X3,...X3-^4j = — ^ - . (3.3/) 

Vr (xi — X2,x\ — X3, . . .X3 — X4) 

Here the weight function W must be chosen so that / satisfies the four-point ex- 
tension of (13.31b : 

j dx\ J dx® j dx\ f(x\ — X2,X\ — X3,X\ — X 4 ,X 2 — Xt,,X 2 ~ X4,Xt, — X4) = 1 . 

(3.38) 

As we saw in (13.30b for the expectation of a two-point operator, a trivial solution 



is 



W(x\ —X2,X\ — X3, . . .Xj, —X4) = 

3 4 



dxl \ dx\ j dx\ Y[ n u [-( x k -X/ 1 )^-*'^] ' 



k=\ l=k+l 



which is unfortunately not covariant because it is a function of |xi — X2I, |xj — 
X3 1 , . . . X3 — X4 but not jt"P —X2,x9—Xj, . . .X3 — JC4. We conjecture that a covari- 
ant weight function W satisfying d3 .3 8 b exists, and will proceed to use it without 
determining its form. 
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3.7 Af-particle version of the variational principle 

When no measurement is being performed, an isolated system obeys the usual 
SQM wave equation. On the other hand, if a measurement is being made, M in- 
cludes the wavefunctions of both the system and (some part of) the measuring 
apparatus, for the reason given in the discussion of A 2 in subsection l3.ll Then we 
must generalize the variational principle (13. 5t to describe the set of all the particles 
in ii". Let those particles be labeled with the subscript n, where for instance n = 1 
might be the "system" being measured, and n > 1 are particles of the apparatus. 
Suppose for simplicity that all N particles are distinguishable and have spin j. 
Then we generalize equations (13.6b and ( 13.7b to 

N 

Ai = £«^„» 1 (3.39) 

n=l 

and 



1 = 

I {U„d 4 x„) \if^(x u x 2 ,x 3 ,. . . ,X N ) &(x U X 2 ,X 3 , . . . ,x N ) \ff(xi ,x 2 ,x 3 ,. . . ,X N ) 
I(Hnd 4 X n ) \(f''>(x l ,X2,X3,...,X N )llf(x l ,X2,X3,...,X N ) 

(3.40) 

For operators depending on four points per particle, we generalize equation (13.35b 
to 

A 2 = ^I i {(<i-<2)[P«3 M (^)-/?„4 M (^4)]} 2 ^ , (3-41) 



in which the four-point expectation (( ))4 is defined as 

«^4» 4 = 



/ II O^ 4 ^ Y\v\x\k-,---XNk) 0{xn,xn,xv$,xu,xn t ... t x m ) 

■I \ n =\ k=l J \k=l ) 



'A \ N 

Y[v(xik,---X N k) Y\f{{Xnk-Xnl ■ 1<^<^<4}) 
K k=\ ) n=\ 



I (nrK^) ( X\\v{x\k,.--xm)?\ X\f{{xnk-x n r-\<k<l<^}) 

J \n=\ k=l J \k=l / n=l 



-I 



(3.42) 



in which the notation x n i, ■■ -x„4 signifies four different spacetime coordinates for 
particle n, and / has the form (13.37b . 

The A^-particle version of the VP has recently been applied by this author 1301 
to the electron two-slit experiment [31 32], including Wheeler's delayed-choice 
variant. l33l 
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3.8 Comparison of the VP to the design constraints 

The VP we have constructed satisfies most or all of the constraints listed in subsec- 
tion |2T| Nothing in the theory prevents us from understanding it as a description 
of matter waves themselves, so the first property is satisfied. Since the VP is the 
sum of terms providing for both state reduction, when that is called for, and behav- 
ior consistent with the SQM wave equation otherwise, it has properties 2-4. We 
have shown that it approximately satisfies the Born rule, and conjecture that the 
approximation is good enough that the experimental record does not contradict it; 
this is just the fifth constraint. That analysis also confirmed that the phase of the 
wavefunction, as exhibited in the phenomenon of zitterbewegung, plays the role 
of the hidden variable, as predicted by constraint 6. 

As we will discuss in subsection 15.31 we are unable to say whether the new 
theory is deterministic, which was property 7. However, the form of the integrals 
makes it clear that properties 8-12 are satisfied. We expect property 13 to hold as 
well; certainly we have not constructed different forms of the theory for micro- 
scopic and macroscopic domains. 



4 Feasibility of experimental tests of the theory 

It seems plausible that this theory could be tested experimentally. One promising 
avenue is the decay process, as opposed to the collapse favored by SQM. We have 
noted that SQM embargoes any information derived from a measurement in less 
than the time At ~ l/AE; but we have not found such a limitation necessary in 
our theory. It would be interesting to make measurements within that time interval 
to see if the decay process could be detected. 

Another possibility is to look for correlations between events close together in 
time. If a system really evolves deterministically, depending on the hidden variable 
fj, then the correlation between two measurements made in rapid succession on 
the same system may show evidence of that. However, the interesting content of 
the correlation function may decay or oscillate on a timescale comparable to the 
time Tjk given by ( 12.3b , so the required timing precision may be unattainable with 
current technology. 

We have also admitted that our derivation of the Born rule includes some ap- 
proximations. If in fact the theory satisfies that rule only approximately, then the 
deviations from the exact rule constitute predictions that could be tested experi- 
mentally, as we suggested at the end of subsection 13. 5 1 



5 New perspectives 

Quantum mechanics has challenged physicists' intuition since its inception, be- 
cause it is understood to operate in ways unlike any other physical theory. Al- 
though we continue to embrace many such ideas, such as the intrinsic nonlocality 
of nature, we have attempted to overturn some that we considered particularly 
objectionable, such as the special but ill-defined treatment of measurements. We 
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have certainly not succeeded in rewriting quantum mechanics in an orderly, con- 
ventional form like, for instance, classical electrodynamics or even special relativ- 
ity (nor did we expect to). In fact, at this stage of our understanding, we appear to 
have introduced some new enigmas. Nevertheless, we have found a natural way to 
assimilate both nonlocality and the measurement process into a variational prin- 
ciple that reduces to the Dirac equation under appropriate conditions. Although 
there are still important mysteries about how to put the pieces together, the ele- 
ments of our variational principle appear to encapsulate the essentials of a theory 
that may have advantages compared to those that have been explored and accepted 
up to now. 

We shall at this point take note of the perspective this theory gives on some 
well-known issues. 



5.1 The "uncertainty principle" 

We will observe here that (at least) two different ideas are commonly referred to 
as "the (Heisenberg) uncertainty principle." One is the idea of complementarity, 
that there are pairs of "complementary variables" for which the product of the two 
uncertainties has a minimum value of the order of h. Since we are taking h = 1, 
those inequalities 

AxAp>\, AtAE > 1 

are equivalent to mathematical relations familiar (in an order-of-magnitude sense) 
from Fourier analysis: 

AxAk > 1 , At Am > 1 . 

Clearly complementarity is a valid principle, expressing fundamental mathemati- 
cal properties of waves. 

On the other hand, the term "uncertainty principle" is also often used to refer to 
the idea that the measurement process is intrinsically random, so the outcome of a 
measurement is uncertain. Our theory rejects that uncertainty principle, proposing 
instead that the result of a measurement depends on the phase of the wavefunction. 

5.2 Time-reversal invariance 

We observed earlier that the collapse process in SQM is time-reversal-dependent, 
as it converts multiple states in the past into a single state in the future. We note 
that our variational principle, like other fundamental laws, has no preference for 
either direction of time. 

The time reversal dependence of SQM is actually more subtle than is often 
appreciated. The state of a system before a measurement may be a superposition 
only in terms of the operator, or set of physical fields, that will be imposed to make 
the measurement. For instance, an electron with spin | in the z direction exists in 
a single eigenstate of the operator S z . If we then consider its spin in the x direction, 
we describe it as being in a superposition of two spin states, ±j. But it has not 
changed its state; we have simply chosen a different basis set in which to describe 
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it. After the measurement, when it is in a single eigenstate of the S x operator, we 
could equally well describe it as in a superposition of two eigenstates of S z . So 
this case is really time-reversal invariant, even according to SQM. 

This idea stands out more clearly when we compare the perspectives of two 
observers "traveling" in opposite directions through time. Suppose that at t = 
we measure S x by quickly switching the alignment of an imposed magnetic field 
from the z to the x direction. An observer traveling "forward" in time (that is, in 
the direction we sense as forward) would say that the single state with S z = \, 
understood as two states with S x = ±A, collapsed at f = to a single state, say 
S x = \. An observer coming from our future toward our past would observe the 
single state with S x = \ and note that at t = the magnetic field changes from the 
x to the z direction. He would conclude that we had measured the z component of 
its spin, causing the two states S z = ±j to collapse to the single one with S z = \. 
Therefore both observers would succeed in interpreting events as consistent with 
SQM. For this thought experiment, the predictions of SQM are actually symmetric 
in time. 

The predictions of our theory are similar in this case, except that the transition 
of the wavefunction from an eigenstate of S z to an eigenstate of S x occurs smoothly 
around t = 0. Since the variational principle has no sensitivity to the direction of 
time, that transition is presumably symmetric about / = 0. This is an example of 
the nonlocality in time referred to earlier. Because the transition begins before the 
instant at which the field alignment changed, this is a violation of causality in the 
usual sense, although others [34 35 1 have pointed out that quantum mechanics can 
violate causality. (The thought experiment does have a flaw, however, in that the 
switching of the field alignment cannot really be instantaneous.) 

Now let us modify that thought experiment somewhat. Suppose that at t = 
we turn off the field in the z direction, and then cause the x-aligned field to 
appear at t = 1 . Our two observers (both trained in SQM) will draw conclusions 
consistent with SQM as before. However, their conclusions will be inconsistent 
with each other, because the first observer will maintain that the system stayed 
in the state with S z = \ until we measured it at t = 1. The second observer will 
regard the appearance (from his perspective) of the z-aligned field at t = as 
our measurement, so he will conclude that for < t < 1 the electron was still in 
the S x = j state that he had observed at times t > 1 . This is a case in which the 
predictions of SQM vary with the direction of time. 

On the other hand, our theory will maintain time-reversal invariance. The evo- 
lution of the wavefunction during the period of interest will be determined by 
minimizing the integral (over time and space) of the appropriate functional, as has 
been described. Whatever the state of the electron between and 1, it will be the 
same for both observers. 



5.3 Is the new theory deterministic? 

We hoped to produce a deterministic theory; in principle, that should be possible, 
because we have a hidden variable. However, it is not yet clear to us how nature 
might solve the variational principle. The wavefunction over an entire region of 
spacetime is available to be varied. (Actually, all of spacetime could theoretically 
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be involved. But any properly conducted experiment must be isolated from un- 
wanted influences, so there must be spatial bounds on the region that must be 
considered. In addition, there must be a start time at which the state of the system 
is well-defined and a stop time at which the results are unambiguous, so the rel- 
evant region of spacetime is completely bounded.) Does nature search the entire 
available phase space and find the solution which gives an absolute minimum in 
the variational principle? Or is a local minimum sufficient? If a local minimum is 
sought, how is the search conducted? If multiple local minima are available (or 
nearby, if nearness in phase space is relevant), how is a single one chosen? 

Until we understand more of these issues, we cannot say whether the theory is 
deterministic. 



6 Summary and conclusions 

SQM explains nature in terms of a wave equation which (whether the Schrodinger, 
Dirac, or Klein-Gordon equation) is linear and local, in spite of the fact that nature 
is clearly nonlinear and nonlocal. The wave equation is supplemented by a "col- 
lapse" process that is nonlinear, nonlocal and is usually understood to be time- 
reversal-dependent. We are suspicious of the adequacy of the wave equation and 
the plausibility of the collapse process, and the criteria that determine which of 
the two processes governs at any given instant seem to us to range from vague 
to unbelievable. Accordingly, we have set out to construct a unified theory that 
explains both types of phenomena in a natural way. 

In order to force the new theory to resemble the SQM wave equation where 
possible but allow it to make transitions (decays) where it must, we have formu- 
lated it as a variational principle. We have found combinations of integrals over 
space and time that seem to have the desired properties, and also provide the re- 
quired nonlocality and nonlinearity in a plausible way. One of the terms in the 
variational principle forces superpositions of states to decay to a single state. The 
other term prevents unwanted discontinuities in time, provides for the theory to 
satisfy the Born rule for the distribution of measurement outcomes, and vanishes 
for Dirac solutions under the right set of circumstances. The theory includes as 
a hidden variable the phase of the wavefunction, which is manifested physically 
via beats between modes of different energies, particularly the zitterbewegung be- 
tween positive- and negative-energy modes. 

We are hopeful that the new theory may be testable with currently available 
technology. 

We note that in addition to eliminating the weaknesses of SQM with regard 
to wavefunction collapse, measurement theory, and nonlocality, the new theory 
repudiates the intrinsic randomness of nature as understood in SQM. Also, it is 
completely time-reversal invariant. We suspect that it may be deterministic, but 
must better understand the application of the variational principle in nature before 
we can be certain of that. 
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APPENDIX: Evaluating the integral in equation ( 13.24b 

We will perform the integration in Eq. ( 13.241 . for which the integrand is given by Eq. d3.23t . 
Since we are considering a nonrelativistic case (that is, one in which all the modal energies are 
close to ±m), the energy difference AEj^t) do es not vary much from ±2m. Then the value of 
the integral of that quantity in the exponential in J3,23> is very close to ±2mt. This suggests that 
we could profitably make a change of variable from t to 

w(t;h)= ( AE jk (t'-ti)dt'. 
Jo 

We also observe that the exponential in J3,23t varies much more rapidly than the other factors 
in the summand. Let us therefore define the function F(w;t l ) by 

AEj k (t-tOF[w(tU i yA]=q(tU i )^^(^\7 

and n ote tha t F varies slowly as a function of its first argument (compared to the exponential). 
Then ( 13 .23 1 becomes 

?- {\Cj(t;ti)\ 2 )=-sgnEj £ AE jk (t -tOF^t^e^ . 
Hi 

EjE k <0 



Now we substitute this into 43.24b : 

|C,(r;/j)| 2 = F,-sgn£, £ / AE jk (t' -t { )F[wif';h);h\ e m ' ( '' ; ''> dt' . 
Hi J '' 

EjE t <0 

But 

dw 

-^■=AE jl (t-t i ) 
so we can change the variable of integration from t' to w, thus: 

\Ci(t-t t )\ 2 = Yi-sgnEj £ P ( '' h) F(yrA)Pdw. 

k^j -Ml*) 
EjE t <0 

Now we integrate by pails. 



\Ci(fA)\ 2 = Yj - sgn£, £ 

Hi 

EjE k <0 

But because F is slowly varying, the second term within the brackets (the integral) is small 
compared to the first, therefore, 

\Cj(t; tl )\ 2 *Yj + i S gnEj £ j^M^-'WttM^}. 

Hi 
EjE k <a 



Then we substitute the definitions of F and w into this equation to get J3.251 . 
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